Lecture 5: Clustering and Adaptation
نویسنده
چکیده
The state index should encode all context information that might influence the acoustics: the third state of the /t/ in “a tree” should be different from the third state of the /t/ in “a tip,” because they are acoustically different. Likewise, lexical stress, phrase position, glottalization, and dialect might matter. Unfortunately, we never have training examples sufficient to learn the likelihood function associated with every possible combination of context variables. Therefore, we use a three-step strategy: (1) learn the mean and variance of MFCCs associated with every combination of context variables available in the training database, (2) cluster the context vectors, using acoustic similarity as a metric, ensuring that we have an adequate number of training examples from each phone, (3) learn a complete mixture Gaussian PDF for each clustered context-dependent phone. There are two ways in which similarity-based clustering can be performed in HTK:
منابع مشابه
Comparison of the learning of two notations: A pilot study
Introduction: MICAP is a new notation in which the teeth areindicated by letters (I-incisor, C-canine, P-premolar, M-molar)and numbers [1,2,3] which are written superscript and subscripton the relevant letters. FDI tooth notation is a two digit systemwhere one digit shows quadrant and the second one shows thetooth of the quadrant. This study aimed to compare the short termretention of knowledge...
متن کاملLecture 6 — April 16 Lecturer
Last time, we introduced the task of hierarchical clustering, in which we aim to produce nested clusterings that reflect the similarity between clusters. This contrasts sharply with our former discussion of “flat” or structureless clustering methods like k-means which do not model relationships between clusters. In this lecture, we will continue our discussion of the standard model-free approac...
متن کاملDynamic language model adaptation using presentation slides for lecture speech recognition
We propose a dynamic language model adaptation method that uses the temporal information from lecture slides for lecture speech recognition. The proposed method consists of two steps. First, the language model is adapted with the text information extracted from all the slides of a given lecture. Next, the text information of a given slide is extracted based on temporal information and used for ...
متن کاملThe Application of Combined Fuzzy Clustering Model and Neural Networks to Measure Valuably of Bank Customers
Currently, acquisition of resources in banks is subject to attraction of the resources of banking customers. Meanwhile, the Bank’s valuable customers are one of the best resources to make profit for banks. Several different models are introduced for evaluation of profitability of the customers; but most of them are classical models and they are unable to evaluate the customers in complete and o...
متن کاملLecture 16: Wireless bit-rate adaptation
This lecture, we’ll continue with the physical layer, but applied to wireless networks instead of wired networks. In particular, we’ll talk about one problem that occurs in the wireless physical layer: wireless bit-rate adaptation. We’ll start by defining bit-rate adaptation. Then we’ll talk about bit-rate adaptation in an idealized model of the wireless environment. We’ll then discuss bit-rate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009